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Abstract 

Using ChIP Seq, we identified 556 and 467 putative STAT6 target sites in the Burkitt's lymphoma cell 
line Ramos and in the normal lung epithelial cell line BEAS2B, respectively. We also examined the pos- 
itions and expression of transcriptional start sites (TSSs) in these cells using our TSS Seq method. We 
observed that 44 and 132 genes in Ramos and BEAS2B, respectively, had STAT6 binding sites in proximal 
regions of their previously reported TSSs that were up-regulated at the transcriptional level. In addition, 
406 and 109 of the STAT6 target sites in Ramos and BEAS2B, respectively, were located in proximal 
regions of previously uncharacterized TSSs. The target genes identified in Ramos and BEAS2B cells in 
this study and in Th2 cells in previous studies rarely overlapped and differed in their identity. 
Interestingly, ChIP Seq analyses of histone modifications and RNA polymerase II revealed that chromatin 
formed an active structure in regions surrounding the STAT6 binding sites; this event also frequently 
occurred in different cell types, although neither STAT6 binding nor TSS induction was observed. The 
rough landscape of STAT6-responsive sites was found to be shaped by chromatin structure, but distinct 
cellular responses were mainly mediated by distinct sets of transcription factors. 
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1. Introduction 

The signal transduction pathway initiated by IL-4 
stimulation is involved in various biological phenom- 
ena, such as Th2 cell differentiation, immunoglobulin 
class switching in B cells and inflammatory responses 
in epithelial cells. 1-3 Malfunctions of the IL-4 signal- 
ling pathway cause improper responses in these cell 
types that subsequently result in chronic allergic dis- 
eases such as asthma and atopic disorders. 4,5 For 
example, in asthma, IL-4 is excessively produced by 
Th2 cells and induces differentiation and class switch- 
ing of allergen-specific B cells towards IgE pro- 
duction. 6,7 At the same time, IL-4 also initiates 
response cascades in various cell types, such as eosi- 
nophils, basophils, mast cells and epithelial cells. 8 In 



particular, CCL1 1 (eotaxin-1) synthesis is enhanced 
in epithelial cells, 9,10 and mucus-producing goblet 
cells become hyperactive and differentiate. 11 
Reflecting the pivotal roles of the IL-4 pathway in 
allergic responses, knockout mice deficient in signal- 
ling molecules, such as the IL-4 receptor and STAT6, 
show symptoms that resemble those of allergic dis- 
orders. 12-14 

In various cell types, the transcription factor (TF) 
STAT6 is a common downstream effector of the IL-4 
signalling pathway. When bound by IL-4, IL-4 receptor 
a is phosphorylated by JAK kinases and recruits STAT6. 
The JAK kinases phosphorylate STAT6, induce its 
dimerization and facilitate its translocation to 
nucleus, where it acts as a transcriptional activator. 15 
Several dozen STAT6 target genes have been 
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identified, such as IL-4, 16 RETNLB, 1 7 SOCS1 , 1 8,1 9 
CD23A 20 and CCL11. It has also been shown that 
the target sites of STAT6 have a consensus sequence: 
TTCNNNNGAA. However, a comprehensive view of 
the target genes of STAT6 remains elusive. In particu- 
lar, previous studies mainly searched for STAT6 target 
genes in T cells 21,22 ; target genes in other cell types 
(such as B cells or epithelial cells) have not been 
fully characterized. 

Recently developed massively parallel sequencing 
technologies coupled with chromatin immunopreci- 
pitation (ChIP Seq) have enabled comprehensive ana- 
lyses of binding sites for TFs. 23,24 Recent papers have 
reported the comprehensive identification of STAT6 
target genes in Th2 cells in both mice and 
humans. 22 ChIP Seq analysis has also been used to 
comprehensively assess chromatin status by monitor- 
ing changes in histone modification patterns. 25-27 
Indeed, for many genes in various cell types, epige- 
netic changes have been shown to play important 
roles in transcriptional regulation in response to cellu- 
lar environmental changes. Moreover, we also devel- 
oped a method to enable large-scale analysis of 
transcriptional start sites (which we termed TSS Seq) 
by combining our full-length cDNA technology 
(oligo-capping) with massively parallel sequen- 
cing. 28,29 In this method, a sequence adaptor necess- 
ary for the sequencing reaction is introduced into the 
mRNA as a cap-replacing oligo so that the region 
immediately downstream of the TSS can be 
sequenced (TSS tag). 30 We demonstrated that the 
position of a TSS and its transcriptional level can be 
analysed at the same time by digital counting of the 
TSS tags. Integrative analysis of epigenomic and tran- 
scriptomic data collectively revealed that the human 
gene transcriptome and its regulation are truly 
dynamic and depend on the cell type. Every cell 
seems to have a unique chromatin status, harbouring 
its own set of active promoters, and a large amount of 
transcription occurs even from intergenic 
regions. 31,32 These results again suggest that it is 
essential to identify and characterize STAT6 target 
genes and their regulation in a wider variety of cell 
types to obtain a more complete view of the biological 
relevance of the IL-4 signalling pathway. 

In this paper, we characterized STAT6 target genes 
and the chromatin status of the target sites by using 
ChIP Seq to evaluate STAT6, RNA polymerase II (pol 
II) and histone modifications. We further combined 
the ChIP Seq data with TSS Seq data to monitor the 
transcriptional consequences of STAT6 binding. We 
identified and characterized 44 putative STAT6 
target genes in the Ramos cell line, which is a 
Burkitt's lymphoma cell line, and 132 in the BEAS2B 
cell line, which is a normal lung epithelial cell line. 
We also compared the STAT6 target genes identified 



in these cell types with recently reported STAT6 
target genes in human Th2 cells. These results collec- 
tively illustrate that different cell types have their own 
mode of STAT6- mediated transcriptional regulation. 
Here, we report that STAT6 has diverse target gene 
repertoires depending on cell type. 

2. Materials and methods 

2.1. Sequence data 

The short-read sequence archive data appearing in 
this paper are registered in GenBank/DDBJ under 
the following accession numbers: SRA008161, 
SRA008162, SRA008163, SRA008164, 

DRA000017, DRA00001 8, DRA000019, 

DRA000020, DRA000021, DRA000022, 

DRA000046, DRA000047, DRA000048, 

DRA000049, DRA000050, DRA000051, 

DRA000052, DRA000053, DRA000054, 

DRA000055, DRA000056, DRA000057, 

DRA000058, DRA000059, DRA000326, 

DRA000327, DRA000328, DRA000329, 

DRA000330, DRA000331, DRA000332, 

DRA000333, DRA000334, DRA000335, 

DRA000336, DRA000337, DRA000338, 

DRA000339, DRA000340 and DRA000341. 

2.2. Cell culture 

Cells of the human B lymphoma line Ramos were cul- 
tured in RPMI1640 medium (Nissui) supplemented 
with 10% fetal bovine serum, 0.3 g/l L-glutamine 
(GIBCO), 0.36% bicarbonate and 60 mg/l kanamycin 
(GIBCO). Human lung epithelial BEAS2B cells were 
maintained in Dulbecco's modified Eagle's medium: 
Nutrient Mixture F-1 2 (GIBCO) with 1 0% fetal bovine 
serum, 1 00 U/ml penicillin — 1 00 (xg/ml streptomycin 
(GIBCO) and 1 |xg/ml puromycin (BD Biosciences). 
These cells were cultured at 37°C in a 5% CO2 incuba- 
tor. Ramos and BEAS2B cells were stimulated by 
50 ng/ml recombinant rhlL-4 (BD Biosciences). 
Ramos cells were stimulated 1 0 |xg/ml human Anti- 
IgM (BETHYL). RNA interference was accomplished by 
transfecting cells with siGENOME SMARTpool STAT6 
siRNA and siGENOME Non-Targeting siRNA #1 
(Dharmacon). Short oligo-RNAs (50 nM) were trans- 
fected using 2 |J/ml Dharmafect 1 transfection 
reagent (Dharmacon) as recommended by the 
manufacturer. 

2.3. Construction of the ChIP Seq library 

For each IP, 5 x 1 0 7 cells were stimulated with rhlL- 
4 at 3 7°C for 30 min. Cells were fixed with a 0.5% 
formaldehyde solution (50 mM 4-(2-hydroxyethyl)- 
1 -piperazineethanesulfonic acid (HEPES)-KOH, pH 
7.5, 100 mM NaCl, 1 mM ethylendiaminetetraacetic 
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acid (EDTA), 0.5 mM ethylene glycol tetraacetic acid 
(EGTA) and 5.5% formaldehyde) at room temperature 
for 1 0 min. Glycine (1 50 mM) was added to quench 
the cross-linking. Cells were rinsed twice with cold- 
Phosphate Buffered Saline (PBS) and harvested. Each 
cell pellet was resuspended in 5 ml of Lysis Buffer 1 
(50 mM HEPES-KOH pH 7.5, 140 mM NaCl, 1 mM 
EDTA, 10% glycerol, 0.5% NP-40 and 0.25% Triton 
X-1 00). The cell lysate was rocked at 4°C for 1 0 min 
and spun at 1 500 rpm for 5 min at 4°C. The super- 
natant was discarded. Each cell pellet was resus- 
pended in 5 ml of Lysis Buffer 2(10 mM Tris-HCl, 
pH 8.0, 200 mM NaCl, 1 mM EDTA and 0.5 mM 
EGTA). The cell lysate was rocked at room temperature 
for 1 0 min and spun at 1 500 rpm for 5 min at 4°C. 
The supernatant was discarded. Each cell pellet was 
resuspended in 1 ml of Lysis Buffer 3 (1 0 mM Tris- 
HCl, pH 8.0, 1 00 mM NaCl, 1 mM EDTA, 0.5 mM 
EGTA, 0.1% Na-deoxycholate and 0.5% N-lauroylsar- 
cosine). The cell lysate was sonicated with 1 0 cycles 
of 30 s in ice water with a sonicator (TOMY SEIKO). 
Triton X-1 00 (1 00 (xl of a 1 0% solution) was added 
to the sonicated cell lysate. The cell lysate was spun 
at 14 000 rpm for 1 0 min. The supernatant was 
transferred to a new tube, and 50 |xl of the super- 
natant was saved as whole cell extract (WCE)-DNA. 
The WCE-DNA was stored at -20°C The supernatant 
was transferred to 1 00 |jlI of magnetic beads (Dynal) 
that were bound to 1 0 (xg of antibody. The samples 
were rotated at 4°C overnight and washed eight 
times with 1 ml of Wash Buffer and once with TE 
buffer containing 50 mM NaCl. Elution Buffer 
(200 |jlI) was added and eluted at 65°C for 1 5 min. 
The samples were spun down, and the supernatant 
was transferred to a new tube. The samples were incu- 
bated at 65°C overnight. WCE-DNA (50 |xl) was 
thawed, and 1 50 |xl of Elution Buffer was added to 
it. The samples were incubated at 65°C overnight. TE 
(200 |xl) was added to the IP and WCE-DNA 
samples. RNase (8 |xl of 10mg/ml; Funakosi) was 
added. The samples were incubated at 37°C for 2 h, 
and 4 julI of 20 mg/ml proteinase K (Takara) was 
added. The samples were incubated at 55°C for 2 h. 
Phenol chloroform extraction and ethanol precipi- 
tation were performed. Samples destined for ChIP 
Seq by lllumina GAIIx were prepared according to 
the manufacturer's instructions. Anti-STAT-6 antibody 
(Santa Cruz, M-20), anti-RNA pol II antibody (Abeam, 
ab81 7), anti-H3K4me3 antibody (ab1 01 2) and anti- 
H3Ac antibody (Millipore, 

06-599) were used for the indicated experiments. 

2.4. Construction of the TSS Seq library 

Cells were seeded 48 h before stimulation. At 24 h 
after seeding, the cells were transfected with STAT6 



targeting and control siRNA and cultured for 24 h 
for the indicated experiments. At 24 h after transfec- 
tion, the transfected cells were stimulated with IL-4 
and control 0.1% Bovine serum albumin (BSA) PBS 
solution. At 24 h after stimulation, the cells were har- 
vested, and RNA was extracted using an RNeasy Kit 
(Qiagen). Fifty micrograms of the obtained total 
RNA was subjected to oligo-capping with some modi- 
fications of the original protocol; BAP-TAP-treated 
total RNAs were ligated with 1.2 (xg of RNA oligo 
(5'- AAUGAUACGGCGACCACCGAGAUCUACACUCUU 
UCCCUACACGACGCUCUUCCGAUCUGG-3'). After the 
DNase I treatment (TaKaRa), polyA + RNA was 
selected using oligo-dT powder (Collaborative). First- 
strand cDNA was synthesized from 1 0 pmol of 
random hexamer primer (5'-CAAGCAGAAGACGG 
CATACGAN N N N N NC- 3 ') using Super Script II 
(Invitrogen) at 12°C for 1 h and 42°C overnight. 
Template RNA was degraded by alkaline treatment. 
One-fifth of the first-strand cDNA was used as the 
PCR template. For the PCR, a Gene Amp PCR kit 
(PerkinElmer) was used with the PCR primers 5'- 
AATGATACGGCG ACCACCGAG- 3 ' and 5'-CAAGCAGA 
AGACGGCATACG-3' under the following reaction con- 
ditions: 1 5 cycles of 94°C for 1 min, 56°C for 1 min 
and 72°C for 2 min. The PCR fragments were size- 
fractionated by 12% polyacrylamide gel electrophor- 
esis, and the fraction at 1 50-2 50 bp was recovered. 
The quality and the quantity of the obtained single- 
stranded first-strand cDNA were assessed using a 
BioAnalyzer (Agilent). 

2.5. Deep sequencing and data processing of the ChIP 
Seq and TSS Seq libraries 

Approximately 1 ng of the size-fractionated cDNA 
was used for sequencing using an lllumina GAIIx. 
Statistics of the tags generated for each experiment 
are summarized in Tables 1-3 and Supplementary 
Fig. S1. The generated sequence tags were mapped 
onto the human genomic sequence (hg1 8 as of 
UCSC Genome Browser) using the sequence align- 
ment Programme Eland (lllumina). Unmapped or 
redundantly mapped sequences were removed from 
the data set, and only uniquely mapped sequences 
without any mismatches were used for further ana- 
lyses. Positions relative to RefSeq genes were calcu- 
lated based on the respective genomic coordinates. 
Genomic coordinates of exons and the protein- 
coding regions of the RefSeq transcripts are as 
described in hg1 8. 

For the ChIP Seq analysis, the binding sites of STAT6, 
RNA pol II and histone modifications were identified 
based on the short-read tag information as follows: 
the region encompassed by each mapped tag 
sequence was extended to 120 bp, which reflects 
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Table 1. Identification of STAT6 targets: summary of the ChIP Seq 
analyses of STAT6 and pol II 



ChIP (STAT6) ChIP (Pol II) 



Ramos 






# total reads 


1 9 099 520 


21 903 761 


# IL-4(+) peaks 


600 


1 7 431 


# IL-4(-) peaks 


73 


1 9 239 


BEAS2B 






# IP total reads 


1 8 981 052 


21 709 675 


# IL-4(+) peaks 


773 


1 3 457 


# IL-4(-) peaks 


314 


81 74 


The numbers of peaks detected usinj 


; the described par- 


ameters are shown. The numbers of peaks detected using 


different parameters are shown in Supplementary Fig. S1. 


Table 2. Identification of STAT6 targets: 


summary of TSS Seq 


analysis 








TSS Seq (IL-4 (+)) 


TSS Seq (IL-4 (-)) 


# Ramos total reads 


1 5 268 493 


1 5 759 41 5 


#BEAS2B total reads 


14 81 8 816 


1 1 628 747 


#BEAS2B (siSTAT6) 


14 1 22 877 


1 3 789 254 


Total reads 


44 21 0 1 86 


41 1 77 41 6 


Table 3. Identification of STAT6 targets: 


number of identified 


STAT6 binding sites in the indicated populations 




Ramos (pol II (+); 


BEAS2B (pol II (+); 




TSS (+)) 


TSS (+)) 


#total 


556 (50) 


467 (1 24) 


Promoter 


1 50 (30) 


358 (1 21) 


Alternative 


205 (1 2) 


33 (1) 


promoter 






Intergenic 


201 (8) 


76(2) 



The number of STAT6 binding sites at which pol II binding 
and TSS induction were simultaneously observed ('STAT6 
targets') is shown in parenthesis. 



the insertion sites of sample DNA fragments. For each 
genomic position, the number of overlapping 
extended tags was counted. Based on the calculated 
tag information, the sum of the included tags was 
evaluated to determine whether more than a 1 0- 
fold difference between the IP and the WCE was 
present. Genomic regions in which positive enrich- 
ment of the tags continued for more than 121 bp 
were then selected. The statistical significance for 
this selection procedure compared with the back- 
ground rate was evaluated using the Poisson probabil- 
ities as previously described in the reference. 24 

For the TSS Seq analysis, the TSS tags were clustered 
into 500-bp bins. Representative TSSs were selected 
as the position from which the largest number of 



TSS tags was mapped. The selected representative 
TSSs were used for further analyses. When the 
genomic coordinates of a representative TSS was 
located within -10 to +1 kb of the 5'-end of a 
RefSeq transcript model, it was associated with the 
RefSeq gene as the TSS of its authentic promoter. For 
digital TSS-tag count analysis, TSS-tag counts were 
normalized as parts per million tags and used for 
further comparisons. 

2.6. Other computational procedures 

For Gene Ontology (GO) term analysis, GO terms 
were associated with the identified STAT6 target 
genes using loc2go at NCBI. GO term enrichment 
was evaluated by calculating hypergeometric distri- 
butions. The statistical significance between the 
expression levels and their fold inductions in Ramos 
and BEAS2B cells was evaluated by the Wilcoxon 
signed-rank test. To analyse the gene expression 
changes of the TFs in Ramos and BEAS2B cells, 140 
TFs were selected in TRANSFAC (Rel. 2010.1). Fold 
changes of the expression levels in response to IL-4 
stimulation were evaluated by digital TSS-tag counts. 
The consensus sequences of the TFs were evaluated 
using MATCH 33 with cut-off values determined using 
minFP, which minimizes false-positive results. The 
enrichment of the detected putative binding sites in 
the active target and silent targets in Ramos and 
BEAS2B cells was evaluated by calculating hypergeo- 
metric distributions. Putative TF binding sites that 
were significantly enriched (P< 0.05) in the respect- 
ive groups were selected. 

2.7 . Gene-specific PCR 

For real-time PCR analysis of independent genes, 
template DNA was prepared using essentially the 
same protocol used for the short-read tag library con- 
struction. PCR primers were designed using Primer3 
(http://frodo.wi.mit.edu/primer3/). The PCR primers 
used in this study are shown in Supplementary Table 
S1. Semi-quantitative Reverse Transcription (RT)- 
PCR was performed using an ABI 7900HT (ABI). 

2.8. Electrophoresis Mobility Shift Assay (EMSA) 
BEAS2B cells were stimulated with rhlL-4 at 37°C 

for 30 min. Cells were suspended in 400 |xl of Buffer 
A (1 0 mM HEPES-KOH, pH 7.8, 1 0 mM KCl, 0.1 mM 
EDTA, 0.1% NP-40). Cells were spun at 5000 rpm for 
1 min at 4°C. The supernatant was discarded. Cell 
pellet was resuspended in IOOuJ of Buffer C 
(50 mM HEPES-KOH, pH 7.8, 420 mM KCl, 0.1 mM 
EDTA, 5 mM MgCl 2 , 2 0% Glycerol and 1 mM sodium 
orthovanadate). The cell lysate was rotated at 4°C 
for 30 min and spun at 1 5000 rpm for 1 5 min at 
4°C. The supernatant was used as nuclear extract. 
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Remaining procedures were performed using the 
LightSh ift Chemiluminescent EMSA kit (Thermo 
SCIENTIFIC). The DNA probe sequences used in this 
study are shown in Supplementary Table S1 . 

3. Results and discussion 

3.1 . Identification of putative STAT6 binding sites in 
Ramos and BEAS2B cells 

To identity STAT6 binding sites, we generated a total 
of 19 099 520 and 1 8 981 052 36-bp sequence 
ChIP Seq tags in Ramos and BEAS2B cells, respectively 
(Tables 1 -3). We selected genomic regions at which 
ChIP Seq tags were enriched at the estimated false 
detection rate of P< 5e-6 (assuming a Poisson distri- 
bution of background tags). We further selected the 
sites at which no such enrichment was observed in 
the absence of IL-4. In total, we identified 556 and 
467 STAT6 binding sites that were specifically 
detected in IL-4-stimulated Ramos and BEAS2B cells, 
respectively (Fig. 1A and B). The validation analysis 
using real-time RT-PCR for individual genes is pre- 
sented in Supplementary Fig. S1A and B. Details of 
the experimental and computational procedures are 
also described in the 'Material and methods' and in 
the Supplementary figures. See Supplementary Fig. 
S1C and D for results using different parameters for 
the computational target identification procedure 
and for the GenBank accession numbers of the 
short-read tag data sets. 

We analysed the genomic positions of the identified 
STAT6 binding sites (Fig. 1C and D). In BEAS2B cells, 
358 (77%) sites were located within -10 to +1 kb 
of a RefSeq gene region (the 5'-end of the most 
upstream RefSeq transcript model was designated as 
zero). In contrast, in Ramos cells, 1 50 (2 7%) of the 
binding sites were located in these regions, whereas 
37% were located in internal gene regions (Fig. 1 C). 
MEME analysis 34 of the binding sites in Ramos cells 
detected the consensus sequence TTCNNNNGAA, 
which matches previous studies. 35,36 However, a 
different consensus sequence, TCTCGCG, was 
detected in BEAS2B cells (Fig. 1 E). For the results of 
the validation analysis in the representative cases, 
see Supplementary Fig. S2. These results may indicate 
that STAT6 employs different modes of transcriptional 
activation in these cells. 

3.2. TSSSeq analysis and ChIP Seq analysis ofRNA pol II 
To directly analyse the transcriptional conse- 
quences of STAT6 binding, particularly in Ramos and 
BEAS2B cells, we constructed and analysed TSS Seq 
libraries using the same materials. We expected that 
TSS Seq analysis would allow us to associate the 
binding sites in intergenic regions and genie regions 
distant from the 5'-end of the authentic RefSeq 



transcript models with TSSs of previously uncharacter- 
ized transcripts. We generated a total of 31 02 7 908 
and 26 447 563 36 bp single-end read TSS tags in 
Ramos and in BEAS2B cells, respectively. Of these, 
22 227 238 (72%) and 20 964 221 (78%) of the 
tags mapped within the 5'-end exon of RefSeq genes 
and upstream proximal regions in Ramos and 
BEAS2B cells, respectively, indicating that the esti- 
mated accuracy of the TSS tags for determining TSS 
positions was ~86% (also see Supplementary Fig. 
S3C). The mapped tags were further clustered into 
bins of 500 to represent putative promoter groups 
(TSS clusters, TSCs). Consistent with our previous 
results, 28 although the numbers of the TSCs were 
generally large, those having TSS tags of > 1 ppm (cor- 
responding to one copy per cell, assuming every cell 
has 1 million mRNA copies) represented a very 
minor population (Supplementary Fig. S3D for stat- 
istics). In total, there were 33 1 96 and 41 359 TSCs 
with expression levels of >1 ppm in IL-4-stimulated 
Ramos and BEAS2B cells, respectively. To further 
ensure the validity of the TSS Seq analysis, we also 
analysed pol II binding sites by ChIP Seq. We generated 
and analysed 21 903 762 and 21 709 675 36-bp 
single-end read tags and identified a total of 1 7 431 
and 1 3 457 pol II binding sites with an estimated 
false detection rate of P< 0.001 (Tables 1-3) in 
Ramos and BEAS2B cells, respectively (Supplemen 
tary Fig. S1 E-G). Of these, ~63, 20 and 1 7% over- 
lapped TSC regions located in the 5'-ends, internal 
parts and intergenic regions, respectively, of the 
RefSeq genes. 

We associated the STAT6 ChIP Seq data with the TSS 
Seq data and the pol II ChIP Seq data. As summarized 
in Tables 4 and 5, in BEAS2B cells, among 358 puta- 
tive STAT6 binding sites that were identified within 
RefSeq gene regions, 1 32 (3 7%) had TSCs that were 
induced by at least 2-fold by IL4 stimulation and pol 
II binding sites in their proximal region (-10 to 
1 kb compared with the binding site). In nine cases, 
there were neither TSCs nor pol II binding signals in 
the surrounding regions, but we found them in 
other locations in the same RefSeq gene region. To 
determine whether these TSCs are associated with 
the STAT6 binding site, we conducted a TSS Seq analy- 
sis using 27 91 2 1 31 TSS tags in STAT6- knockdown 
BEAS2B cells (Table 2; also see Supplementary Fig. 
S3E-G). In 85% of the STAT6 target TSSs, the tran- 
scriptional induction indicated by digital TSS Seq-tag 
counts was repressed in STAT6 knockdown cells, indi- 
cating that the observed transcriptional induction at 
the associated TSCs was directly mediated by STAT6 
(Fig. 1 F). Altogether, we identified 44 and 132 
STAT6 target TSCs located at the 5'-ends of RefSeq 
genes, 45 and 9 TSCs previously uncharacterized 
TSCs (putatively alternative promoters of the same 
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Figure 1 . Identification of STAT6 target RefSeq genes. (A) Examples of identified STAT6 targets in authentic RefSeq promoter regions in Ramos cells 
(left panel) and BEAS2B cells (right panel). STAT6 bindingsites are indicated by red lines. Expression changes evaluated by digital TSS-tag counts are 
also shown in the bottom margin. (B) STAT6 targets in putative alternative promoter regions (left panel) and in intergenic regions (right panel) in 
Ramos cells. Upper and lower panels present the results from ChIP Seq and TSS Seq analyses, respectively. The IP used for each ChIP Seq analysis is 
shown in the margin. WCE, whole cell extract used as a background control. (C) Frequency of STAT 6 bindingsites relative to RefSeq regions in Ramos 
cells (left panel) and BEAS2B cells (right panel). (D) Distance between STAT6 binding sites and 5'-ends of RefSeq transcript models in Ramos cells 
(red bar) and BEAS2B cells (blue bar). (E) Consensus sequences deduced from STAT6 binding sites in Ramos cells (upper panel) and BEAS2B cells 
(lower panel). (F) Fold induction oftheSTAT6 target TSSs evaluated by digital TSS-tag counts in STAT6-knockdown BEAS2Bcells.Thei/-axis indicates 
the relative fold induction compared with that in wild-type BEAS2B cells. TSSs indicated by arrows showed reduced induction in knockdown cells. 
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Table 4. Epigenetic regulation of STAT6-mediated transcriptional 
activation: summary of ChIP Seq analyses of histone 
modifications 



ChIP (H3K4me3) 



ChIP (H3Ac) 



Ramos 

# total reads 

# IL-4(+) peaks 

# IL-4(-) peaks 
BEAS2B 

# IP total reads 

# IL-4(+) peaks 

# IL-4(— ) peaks 



1 30 1 35 923 
27 364 
26 1 28 

117 611 512 
24 281 
23 485 



1 23 883 1 44 
46 294 
49 953 

1 24 648 546 
37 480 
32 777 



gene) and 34 and 6 TSCs located in intergenic regions 
in Ramos and BEAS2B cells, respectively. Full lists of 
the identified STAT6 targets in the respective cat- 
egories are shown in Supplementary Table S2. For 
the remainder of the 435 and 320 STAT6 binding 
sites, there was no induced TSC associated with the 
binding site (see the following sections for detailed 
characterization). For the results of the similar ana- 
lyses using antigen-stimulated Ramos cells, see 
Supplementary Fig. S4A-E. 

3.3. Characterization ofSTAT6 targets 

We identified STAT6 targets in multiple categories 
(see also Supplementary Fig. S5 for other examples). 

3.3.7. protein-coding gene targets In Ramos and 
BEAS2B cells, 44 and 1 32 RefSeq genes were identified 



asSTAT6 target genes, respectively, for which binding of 
STAT6 to the 5'-end region was observed together with 
the presence and induction of TSC and pol II binding 
(Fig. 1 A). Previously reported target genes such as the 
SOCS1 in BEAS2B cells and low-affinity Fc receptor 
genes (CD23A) in Ramos cells are included, as are 
some uncharacterized genes. For example, the MS4A1 
gene (NM_021950) was identified as a STAT6 target 
gene in Ramos cells. MS4A1 is a human B-lymphocyte 
surface molecule, and in vitro studies with an MS4A1 
monoclonal antibody showed that this gene plays 
important roles in the regulation of B cell activation 
and proliferation. 37,38 Moreover, this antibody is clini- 
cally used as an anti-rheumatic drug. 39,40 
Identification of a STAT6-mediated pathway of MS4A1 
induction may present a target for the design of novel 
drugs. In general, GO term 41 enrichment analysis 
showed that GO terms associated with protein 
secretion and trafficking molecules were enriched 
among the STAT6 targets in Ramos cells. However, in 
BEAS2B cells, GO terms associated with transcription 
and chromatin remodelling were enriched (Fig. 2A). 
STAT6 may have different repertoires of targets in 
Ramos and BEAS2B cells. 

3.3.2. Putative alternative promoter targets In 
45 cases in Ramos cells and 9 cases in BEAS2B 
cells, STAT6 binding sites were found to be associ- 
ated with previously uncharacterized putative 
alternative promoters of protein-coding genes (left 
panel; Fig. 1 B). For example, a putative STAT6- 
responsive alternative promoter was identified in 



386 



Characterization of STAT Target Genes 



[Vol. 1 8, 



Table 5. Epigenetic regulation of STAT6-mediated transcriptional activation: the frequency of the STAT6 targets having the indicated 
signals are shown for the indicated populations 





Promoter 




Alternative promote 


r 


Intergenic 




Ramos 


BEAS2B 


Ramos 


BEAS2B 


Ramos 


BEAS2B 


#total 


1 50 


358 


205 


33 


201 


76 


H3K4me3 (+) 


1 24 (83%) 


327 (91%) 


1 9 (9%) 


3 (9%) 


1 6 (8%) 


7 (9%) 


H3Ac (+) 


131 (87%) 


321 (90%) 


91 (44%) 


2 (6%) 


1 05 (52%) 


1 0 (1 3%) 


Pol II (+) 


1 1 6 (77%) 


313 (87%) 


47 (23%) 


6 (1 8%) 


51 (25%) 


2 5 (3 3%) 


TSC induction (+) 


44 (29%) 


1 32 (37%) 


45 (22%) 


9 (2 7%) 


32 (16%) 


6 (8%) 





CO term 


<;oid 


P-valuc 




protein transporter activity 


00:0008565 


0.0019 


Ramos 


clathrin adaptor complex 


GO:003013I 


0.0024 
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GO:0033158 


0.0037 




platelet dense tubular network membrane 


GO:0031095 


0.0037 




ATP-depcndent chromatin remodeling 


GO:0043044 


5.2E-05 


BEAS2B 


nucleosome binding 


GO:003I491 


5.2E-05 




DNA bindinn 


GO:0003677 


1.2E-04 




protein binding 


GO:00055I5 


3.9E-07 
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GO:0008134 


1.6E-05 
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cell-cell adhesionremodcling 


GO:00 16337 


5.4E-05 
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Figure 2. Comparison of STAT6 target genes between Ramos and BEAS2B cells. (A) GO terms associated with the STAT6 target genes 
identified in Ramos cells (top panel), BEAS2B cells (middle panel) and Th2 cells (bottom panel). (B) Overlap of the STAT6 target 
genes in Ramos cells (red), BEAS2B cells (blue) and Th2 cells (green). (C) Expression levels (left panel) and fold induction (right 
panels) in response to IL-4 stimulation evaluated by digital TSS-tag counts. Statistical significance between the indicated populations 
is shown in the margin. STAT6 target genes in Ramos and BEAS2B cells. 



the second intron of the GAB1 gene 
(Supplementary Fig. S5). GAB1 is tyrosine-phos- 
phorylated upon stimulation by various cytokines, 
growth factors and antigen receptors. 
Phosphorylated GAB1 interacts with phosphatidyl- 
inositol 3-kinase (PI 3 K), and the N-terminal PH 
domain of GAB1 binds to 3-phosphoinositide 
(PI3). 42 Putative transcripts from the alternative 
promoter of GAB1 lack the PH domain, which may 
reduce PI3K lymphocyte activation via immunore- 
ceptors and cytokine receptors by competing as a 
PI3 docking-deficient isoform. 



3.3.3. Putative non-protein coding gene 
targets Among the STAT6 binding sites 
associated with intergenic TSCs, one TSC overlapped 
a previously identified putative ncRNA gene ('NR' 
gene) according to the UCSC Genome Browser. In 
one additional case, an intergenic TSC was associ- 
ated with a fully sequenced full-length cDNA, 
suggesting that STAT6 intergenic targets are mostly 
uncharacterized transcripts. 43-45 The right panel in 
Fig. 1 B presents one such case. Although the mol- 
ecular functions of these intergenic transcripts 
remain elusive, this is the first report to indicate 
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the possibility that STAT6 also regulates the 
expression of so-called putative non-coding RNA 
transcripts. 

3.4. Diverse regulation ofSTAT6 targets in different 
cell types 

We compared the STAT6 target genes identified in 
Ramos cells with those in BEAS2B cells and found 
that they did not overlap at all (Fig. 2B). Also, as 
shown in Figs 1 and 2, STAT6 target genes showed dis- 
tinct characteristics between Ramos and BEAS2B cells. 
Specifically, the target genes had different consensus 
binding sequences, different relative binding site pos- 
itions compared with the RefSeq region and different 
associated GO terms. We also included in the com- 
parison STAT6 target sites in human Th2 cells that 
were identified in a recent study. 22 The reported con- 
sensus binding sequence of TTCNNNNGAA in Th2 
cells was similar to the consensus sequence we ident- 
ified in Ramos cells. However, the reported target 
genes in Th2 cells very rarely overlapped with those 
in Ramos or BEAS2B cells. In contrast, the expression 
and induction levels of the STAT6 target genes in 
Ramos and BEAS2B cells were nearly equivalent 
(Fig. 2C). 

Having observed that different cells have different 
repertoires of STAT6 targets even when they are simi- 
larly activated, we speculated that this differential 
regulation of STAT6 activation may be mediated by 
distinct chromatin structures in distinct cell types. 
We examined changes in histone modification pat- 
terns in response to IL-4 stimulation, namely tri- 
methylation of lysine 4 (H3K4me3) and acetylation 
(H3Ac) of histone 3, using ChIP Seq analysis of a 
total of 496 279 125 36-bp single-end read tags 
(Table 4). These histone modifications are thought 
to be representative markers of active chromatin. 46 
As shown in Fig. 3A, we observed that H3K4me3 
and H3Ac were similarly enriched around the TSSs 
of STAT6 target genes (STAT6-positive, pol II binding- 
positive and TSS induction-positive genes) both in 
Ramos and BEAS2B cells, suggesting that active tran- 
scription occurs at the identified target sites. 
Interestingly, in both cell types, active histone 
markers were already present around the target sites 
prior to IL-4 stimulation. It is possible that, upon IL- 
4 stimulation, STAT6 is recruited to the chromatin 
that is already programmed to be bound by STAT6 
or other TFs (left panels; Fig. 3A and B; also see 
Fig. 4B for a schematic representation). 

We further examined the histone modification pat- 
terns in Ramos cells in regions that are STAT6 target 
regions in BEAS2B cells but not in Ramos cells (we 
denote these as 'silent targets' hereafter). 
Interestingly, active histone markers were also 



observed in Ramos cells even though these regions 
were silent in this cell type; neither STAT6 binding 
nor the presence of TSS tags was observed (right 
panels; Fig. 3A). Pol II recruitment was even observed 
in these regions (right bottom panel; Fig. 3A). 
Although they are in a cellular context in which no 
transcriptional induction occurs, these sites seem to 
be resting in a prepared state until STAT6 is recruited. 

3.5. Distinct modes of STAT6-mediated transcriptional 
induction in different cell types 

We also noticed that numerous STAT6 binding sites 
in Ramos cells did not consequently induce transcrip- 
tion even though STAT6 binding was clearly detected 
by ChIP Seq and confirmed by real-time PCR 
(Tables 4 and 5; also see Supplementary Fig. S6 for 
examples). We found that a majority of these STAT6 
binding sites also resided within H3K4me3- and 
H3Ac-positive regions and that pol II was frequently 
recruited. Thus, only the induction of the TSS was 
absent in these regions (Table 5). Similar observations 
were also made regarding putative alternative promo- 
ter binding sites of STAT6 and intergenic targets of 
STAT6. In these cases, some additional factors 
seemed to be missing, which hampered the full acti- 
vation of transcription. 

In BEAS2B cells, we also found a number of STAT6 
binding sites lacking transcriptional induction, which 
is similar to the case in Ramos cells (Table 5). 
However, in BEAS2B cells, active histone markers 
were found to be less prevalent around the silent 
target regions (STAT6 target sites in Ramos cells), 
and pol II was not bound in these regions (right 
panels; Fig. 3B). In the active target sites, active 
histone markers were already present prior to STAT6 
binding, but pol II recruitment occurred only after 
IL-4 stimulation (left panels; Fig. 3B). Results of a 
similar analysis for Th2 target regions in Ramos and 
BEAS2B cells are shown in Supplementary Fig. S7. It 
is possible that STAT6-mediated transcriptional acti- 
vation consists of multiple regulatory steps ranging 
from the formation of active chromatin to the recruit- 
ment of STAT6 and transcriptional activation, each of 
which requires different factors that may be cell- 
specific (Fig. 4B). 

3.6. Transcriptional regulation of Ramos and BEAS2B 
cells 

To search for factors missing in Ramos and BEAS2B 
cells that drive transcriptional activation in other cells, 
we analysed the transcriptional induction of genes 
encoding TFs in response to IL-4 stimulation by 
counting TSS tags. For this purpose, we selected 1 83 
previously functionally characterized TF genes from 
TRANSFAC. 47 As shown in Fig. 4A, -45 (25%) of the 
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TF genes were either up- or down-regulated in Ramos 
or BEAS2B cells. Although similar numbers of TF genes 
were up- or down-regulated, they rarely overlapped 
between cell types (P=0.8). We further examined 
which particular TFs had consensus binding sites 
enriched in the population of STAT6 target genes, 
although they were diluted in the surrounding 
regions of the STAT6 binding sites at which no tran- 
scriptional induction occurred. In BEAS2B cells, we 
found that the predicted binding sites of FAC1, 
which catalyses nucleosome sliding as a component 
of the NURF complex, 48,49 were enriched (P< 3e-5) 
in the former population. The expression of FAC1 
was induced in response to IL-4 by 23-fold in 
BEAS2B cells, whereas it was repressed by 50-fold in 
Ramos cells. On the other hand, in Ramos cells, the 
predicted binding sites of CEBP-7 were found to be 
enriched around the transcription-inducing STAT6 
binding sites (P<0.02), and transcription level of 



CEBP-7 was induced 2 7-fold upon IL-4 stimulation. 
These TFs may be responsible for the cell-specific 
regulation of STAT6 target genes. A complete list of 
the identified TFs in the respective cell types and 
their statistical significance is shown in 
Supplementary Fig. S8. These TFs should be the first 
targets in any attempt to elucidate the TFs that coop- 
eratively regulate the transcription of STAT6 targets in 
a cell-specific manner. 

3.7. Conclusions 

In this paper, we describe the characterization of 
STAT6 target genes in Ramos and BEAS2B cells using 
various types of massively parallel sequencing ana- 
lyses. Among the 44 and 132 STAT6 target genes 
identified in these two respective cell types, several 
previously uncharacterized target genes were 
included (Supplementary Table S2). The target 
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Figure 3. Status of histone modifications and pol II binding in the proximal regions of STAT6 target TSSs. (A) Distribution of the averaged 
ChIP Seq tags for H3K4me3 (top panels), H3Ac (middle panels) and pol II (bottom panels) in Ramos cells. Data from active target genes 
(STAT6 binding plus TSS induction in Ramos cells) are shown in the left panels, and data from silent target genes (STAT6 binding plus TSS 
induction in BEAS2B cells but both negative in Ramos cells) are shown in the right panels. Blue, green, red and purple lines indicate the 
results for the IP (IL-4 (+)), IP (IL-4 (-)),WCE (IL-4 (+)) and WCE (IL-4 (-)) experiments, respectively. On thex-axis, the position of the 
associated TSS is designated as zero. (B) Results of an analysis similar to that shown in (A) in BEAS2B cells. 
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Figure 3. Continued 



genes identified in this study reside in various func- 
tional categories in different cell types and expand 
the opportunities to treat IL-4-related allergic diseases 
by offering potential pharmacological targets. 

It should also be noted that putative alternative 
promoters were among the STAT6 targets identified 
in this study. Because the potential transcripts from 
these alternative promoters lack or have different N- 
terminal protein-coding regions in many cases (exem- 
plified in Fig. 1 B), they are likely to have diversified 
protein functions. Although elucidating the detailed 
biological roles of these targets will require further 
in-depth functional assays, it is likely that STAT6 also 
utilizes these previously poorly characterized tran- 
scripts in addition to canonical protein-coding tran- 
scripts for the fine tuning of target gene function. 
We also identified STAT6 target genes in intergenic 
regions of RefSeq transcripts that are likely to drive 
non-protein coding transcripts. Multi-layered regu- 
lation mediated by canonical promoters of protein- 
coding genes, alternative promoters and non-coding 



RNAs may further increase the complexity of the 
responses to STAT6-mediated IL-4 stimulation in 
various cell types, thereby collectively orchestrating 
immune responses in humans. One of the greatest 
advantages of integrating ChIP Seq data with TSS Seq 
data may be that previously uncharacterized mol- 
ecules could be analysed in a uniform platform. 

We also found that different cell types have different 
repertoires of STAT6 targets. In Ramos cells, genes 
mainly involved in protein secretion and protein traf- 
ficking (such as MS4A1 and CCR6 50 ) were identified 
as STAT6 targets, suggesting that one of the main bio- 
logical roles of STAT6 activation is to change patterns 
of secretory proteins and/or cell surface proteins 
during the maturation of B cells. On the other hand, 
in BEAS2B cells, genes involved in the alteration of 
transcriptional programs (such as PTRH2 51 and 
FUBP1 52 ) were found to be enriched. We further com- 
pared the newly identified STAT6 targets in Ramos 
and BEAS2B cells with recently identified STAT6 
targets in Th2 cells. We found that neither overlapped 



390 



Characterization of STAT Target Genes 



[Vol. 1 8, 



■ Ramos 
18EAS2B 



i,l„ lli„ 



r Y ■ ^ppi|i,ii,i.j 



B 

Ramos 



IL-4I 



IL-4 (+) 



Transcriptional Factor 

Active Target 

jjjl active chromatin 




additional transcriptional 



STAT6* factor 1 




induction 



BEAS2B 



IL-4(-) 



IL-4 (+) 



silent chromatin 



additional 



^factor 2^B^ 



Figure 4. Distinct regulation of STAT6 targets in different cell types. (A) Fold change of TF genes in response to IL-4 stimulation in Ramos 
cells (red bar) and BEAS2B cells (blue bar) evaluated by digital TSS-tag counts. TFs were ordered by their fold induction in Ramos cells on 
the x-axis. (B) Schematic representation of a model for distinct STAT6 activation in Ramos cells (upper panels) and BEAS2B cells (lower 
panels). Inferred chromatin status and pol II binding are shown for the active targets in the left panel and for the silent targets in the 
right panel. Green circles, STAT6; purple circles, pol II; yellow and orange circles, putative additional factors in Ramos and BEAS2B cells, 
respectively; left and right columns, active and silent chromatin, respectively. 



regarding functional categories, suggesting that STAT6 
targets bring distinct cellular consequences depend- 
ing on cell types (Fig. 2). 

We attempted to identify how different STAT6 tran- 
scriptional activation programmes are realized in 
different cell types. We found that the chromatin 
status and pol II binding patterns were different 
between Ramos and BEAS2B cells (Fig. 3). In BEAS2B 
cells, the formation of active chromatin and recruit- 
ment of pol II were specific to active STAT6 target 
sites. Pol II recruitment occurred only upon IL-4 
stimulation. In contrast, in Ramos cells, this event 
sometimes occurred even around silent STAT6 
targets, which are active in different cell types but 
silent in this cell type. Different cell types seem to 
have different modes of STAT6-mediated target acti- 
vation. We therefore searched for factors that may 
be necessary for full transcriptional activation of 
STAT6 targets in Ramos and BEAS2B cells. As putative 
cooperative factors for STAT6, we identified FAC1 
(which is a component of the nucleosome sliding 
complex NURF) in BEAS2B cells and CEBP-7 (which is 
a transcriptional activator 53 ) in Ramos cells. The 
identification of different categories of TFs in different 
cell types may indicate that transcriptional induction 
mediated by STAT6 requires modification of the chro- 
matin structure in BEAS2B cells, whereas the 



recruitment of synergetic TFs is more important in 
Ramos cells. Distinct cellular responses may be 
mediated by the intrinsic chromatin status and 
unique sets of TFs, and the factors that play the 
most important roles may be different depending on 
cell type. 

We also found that STAT6 binding did not always 
result in transcriptional activation. On the contrary, 
the targets that were transcriptionally activated 
upon STAT6 binding were relatively few in both 
Ramos and BEAS2B cells. This feature of STAT6 
binding was also observed in Th2 cells by other 
groups. 22 There seems to be complex regulation of 
transcriptional induction even after the formation of 
active chromatin and the recruitment of TFs. The 
advantage of using massively parallel sequencing as 
a common platform is that genome-wide data can 
be generated simultaneously for different steps of 
transcriptional activation, ranging from the binding 
of STAT6 and histone modification to pol II binding 
and transcriptional induction. It will be important to 
determine which regulatory step is the most impor- 
tant for each gene to fully understand STAT6- 
mediated transcriptional activation mechanisms. 
Further in-depth analyses of the identified target 
genes will also be necessary to understand the mech- 
anisms whereby these factors in their respective cell 
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types collectively induce necessary and/or harmful 
immune responses in healthy and diseased individ- 
uals. Biochemical and phenotypic analyses at the cel- 
lular and organismal levels are still beyond the reach 
of genome-wide approaches. However, the target 
gene catalogue identified and characterized in this 
study in different cell types should serve as a valuable 
resource. 
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